"அணில்" meaning in All languages combined

See அணில் on Wiktionary

Noun [Tamil]

IPA: /aɳil/ Audio: Ta-அணில்.ogg
Etymology: Cognate with Kannada ಅಳಿಲು (aḷilu) and Malayalam അണിൽ (aṇil). Etymology templates: {{cog|kn|ಅಳಿಲು}} Kannada ಅಳಿಲು (aḷilu), {{cog|ml|അണിൽ}} Malayalam അണിൽ (aṇil) Head templates: {{ta-noun|pl=அணிற்கள்}} அணில் • (aṇil) (plural அணிற்கள்) Inflection templates: {{ta-decl|அணில்|அணிற்கள்|அணிலே|அணிலு}} Forms: aṇil [romanization], அணிற்கள் [plural], no-table-tags [table-tags], அணில் [nominative, singular], அணிற்கள் [nominative, plural], அணிலே [singular, vocative], அணிற்களே [plural, vocative], அணிலை [accusative, singular], அணிற்களை [accusative, plural], அணிலுக்கு [dative, singular], அணிற்களுக்கு [dative, plural], அணிலுக்காக [benefactive, singular], அணிற்களுக்காக [benefactive, plural], அணிலுடைய [error-unrecognized-form, singular], அணிற்களுடைய [error-unrecognized-form, plural], அணிலின் [error-unrecognized-form, singular], அணிற்களின் [error-unrecognized-form, plural], அணிலில் [error-unrecognized-form, singular], அணிற்களில் [error-unrecognized-form, plural], அணிலிடம் [error-unrecognized-form, singular], அணிற்களிடம் [error-unrecognized-form, plural], அணிலோடு [error-unrecognized-form, singular], அணிற்களோடு [error-unrecognized-form, plural], அணிலுடன் [error-unrecognized-form, singular], அணிற்களுடன் [error-unrecognized-form, plural], அணிலால் [instrumental, singular], அணிற்களால் [instrumental, plural], அணிலிலிருந்து [ablative, singular], அணிற்களிலிருந்து [ablative, plural], அணிற்பிள்ளை [alternative], அணில்பிள்ளை [alternative]
  1. Indian Palm Squirrel (Funambulus palmarum)
    Sense id: en-அணில்-ta-noun-DRU6sYeg Categories (other): Pages with 1 entry, Pages with entries, Tamil entries with incorrect language header, Tamil terms with redundant script codes, Animals, Squirrels Disambiguation of Pages with 1 entry: 62 27 11 Disambiguation of Pages with entries: 66 25 9 Disambiguation of Tamil entries with incorrect language header: 63 21 16 Disambiguation of Tamil terms with redundant script codes: 45 26 29 Disambiguation of Animals: 76 19 5 Disambiguation of Squirrels: 68 30 2
  2. (by extension) squirrel (any of the rodents of the family Sciuridae) Tags: broadly
    Sense id: en-அணில்-ta-noun-~FJFD~XO
  3. (social media, chiefly derogatory) a fan of actor Vijay Tags: derogatory
    Sense id: en-அணில்-ta-noun-mGKGXvP5 Categories (other): Social media
The following are not (yet) sense-disambiguated
Derived forms: அணிற்பிள்ளை (aṇiṟpiḷḷai)
Categories (other): People Disambiguation of People: 0 0 0
{
  "categories": [
    {
      "_dis": "0 0 0",
      "kind": "other",
      "langcode": "ta",
      "name": "People",
      "orig": "ta:People",
      "parents": [],
      "source": "w+disamb"
    }
  ],
  "derived": [
    {
      "_dis1": "0 0 0",
      "roman": "aṇiṟpiḷḷai",
      "word": "அணிற்பிள்ளை"
    }
  ],
  "etymology_templates": [
    {
      "args": {
        "1": "kn",
        "2": "ಅಳಿಲು"
      },
      "expansion": "Kannada ಅಳಿಲು (aḷilu)",
      "name": "cog"
    },
    {
      "args": {
        "1": "ml",
        "2": "അണിൽ"
      },
      "expansion": "Malayalam അണിൽ (aṇil)",
      "name": "cog"
    }
  ],
  "etymology_text": "Cognate with Kannada ಅಳಿಲು (aḷilu) and Malayalam അണിൽ (aṇil).",
  "forms": [
    {
      "form": "aṇil",
      "tags": [
        "romanization"
      ]
    },
    {
      "form": "அணிற்கள்",
      "tags": [
        "plural"
      ]
    },
    {
      "form": "no-table-tags",
      "source": "declension",
      "tags": [
        "table-tags"
      ]
    },
    {
      "form": "ta-decl",
      "source": "declension",
      "tags": [
        "inflection-template"
      ]
    },
    {
      "form": "அணில்",
      "roman": "aṇil",
      "source": "declension",
      "tags": [
        "nominative",
        "singular"
      ]
    },
    {
      "form": "அணிற்கள்",
      "roman": "aṇiṟkaḷ",
      "source": "declension",
      "tags": [
        "nominative",
        "plural"
      ]
    },
    {
      "form": "அணிலே",
      "roman": "aṇilē",
      "source": "declension",
      "tags": [
        "singular",
        "vocative"
      ]
    },
    {
      "form": "அணிற்களே",
      "roman": "aṇiṟkaḷē",
      "source": "declension",
      "tags": [
        "plural",
        "vocative"
      ]
    },
    {
      "form": "அணிலை",
      "roman": "aṇilai",
      "source": "declension",
      "tags": [
        "accusative",
        "singular"
      ]
    },
    {
      "form": "அணிற்களை",
      "roman": "aṇiṟkaḷai",
      "source": "declension",
      "tags": [
        "accusative",
        "plural"
      ]
    },
    {
      "form": "அணிலுக்கு",
      "roman": "aṇilukku",
      "source": "declension",
      "tags": [
        "dative",
        "singular"
      ]
    },
    {
      "form": "அணிற்களுக்கு",
      "roman": "aṇiṟkaḷukku",
      "source": "declension",
      "tags": [
        "dative",
        "plural"
      ]
    },
    {
      "form": "அணிலுக்காக",
      "roman": "aṇilukkāka",
      "source": "declension",
      "tags": [
        "benefactive",
        "singular"
      ]
    },
    {
      "form": "அணிற்களுக்காக",
      "roman": "aṇiṟkaḷukkāka",
      "source": "declension",
      "tags": [
        "benefactive",
        "plural"
      ]
    },
    {
      "form": "அணிலுடைய",
      "roman": "aṇiluṭaiya",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "singular"
      ]
    },
    {
      "form": "அணிற்களுடைய",
      "roman": "aṇiṟkaḷuṭaiya",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "plural"
      ]
    },
    {
      "form": "அணிலின்",
      "roman": "aṇiliṉ",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "singular"
      ]
    },
    {
      "form": "அணிற்களின்",
      "roman": "aṇiṟkaḷiṉ",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "plural"
      ]
    },
    {
      "form": "அணிலில்",
      "roman": "aṇilil",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "singular"
      ]
    },
    {
      "form": "அணிற்களில்",
      "roman": "aṇiṟkaḷil",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "plural"
      ]
    },
    {
      "form": "அணிலிடம்",
      "roman": "aṇiliṭam",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "singular"
      ]
    },
    {
      "form": "அணிற்களிடம்",
      "roman": "aṇiṟkaḷiṭam",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "plural"
      ]
    },
    {
      "form": "அணிலோடு",
      "roman": "aṇilōṭu",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "singular"
      ]
    },
    {
      "form": "அணிற்களோடு",
      "roman": "aṇiṟkaḷōṭu",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "plural"
      ]
    },
    {
      "form": "அணிலுடன்",
      "roman": "aṇiluṭaṉ",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "singular"
      ]
    },
    {
      "form": "அணிற்களுடன்",
      "roman": "aṇiṟkaḷuṭaṉ",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "plural"
      ]
    },
    {
      "form": "அணிலால்",
      "roman": "aṇilāl",
      "source": "declension",
      "tags": [
        "instrumental",
        "singular"
      ]
    },
    {
      "form": "அணிற்களால்",
      "roman": "aṇiṟkaḷāl",
      "source": "declension",
      "tags": [
        "instrumental",
        "plural"
      ]
    },
    {
      "form": "அணிலிலிருந்து",
      "roman": "aṇililiruntu",
      "source": "declension",
      "tags": [
        "ablative",
        "singular"
      ]
    },
    {
      "form": "அணிற்களிலிருந்து",
      "roman": "aṇiṟkaḷiliruntu",
      "source": "declension",
      "tags": [
        "ablative",
        "plural"
      ]
    },
    {
      "form": "அணிற்பிள்ளை",
      "roman": "aṇiṟpiḷḷai",
      "tags": [
        "alternative"
      ]
    },
    {
      "form": "அணில்பிள்ளை",
      "roman": "aṇilpiḷḷai",
      "tags": [
        "alternative"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "pl": "அணிற்கள்"
      },
      "expansion": "அணில் • (aṇil) (plural அணிற்கள்)",
      "name": "ta-noun"
    }
  ],
  "inflection_templates": [
    {
      "args": {
        "1": "அணில்",
        "2": "அணிற்கள்",
        "3": "அணிலே",
        "4": "அணிலு"
      },
      "name": "ta-decl"
    }
  ],
  "lang": "Tamil",
  "lang_code": "ta",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "_dis": "62 27 11",
          "kind": "other",
          "name": "Pages with 1 entry",
          "parents": [],
          "source": "w+disamb"
        },
        {
          "_dis": "66 25 9",
          "kind": "other",
          "name": "Pages with entries",
          "parents": [],
          "source": "w+disamb"
        },
        {
          "_dis": "63 21 16",
          "kind": "other",
          "name": "Tamil entries with incorrect language header",
          "parents": [],
          "source": "w+disamb"
        },
        {
          "_dis": "45 26 29",
          "kind": "other",
          "name": "Tamil terms with redundant script codes",
          "parents": [],
          "source": "w+disamb"
        },
        {
          "_dis": "76 19 5",
          "kind": "other",
          "langcode": "ta",
          "name": "Animals",
          "orig": "ta:Animals",
          "parents": [],
          "source": "w+disamb"
        },
        {
          "_dis": "68 30 2",
          "kind": "other",
          "langcode": "ta",
          "name": "Squirrels",
          "orig": "ta:Squirrels",
          "parents": [],
          "source": "w+disamb"
        }
      ],
      "glosses": [
        "Indian Palm Squirrel (Funambulus palmarum)"
      ],
      "id": "en-அணில்-ta-noun-DRU6sYeg"
    },
    {
      "glosses": [
        "squirrel (any of the rodents of the family Sciuridae)"
      ],
      "id": "en-அணில்-ta-noun-~FJFD~XO",
      "links": [
        [
          "squirrel",
          "squirrel#English:_Q9482"
        ],
        [
          "rodent",
          "rodent"
        ],
        [
          "Sciuridae",
          "Sciuridae#Translingual"
        ]
      ],
      "raw_glosses": [
        "(by extension) squirrel (any of the rodents of the family Sciuridae)"
      ],
      "tags": [
        "broadly"
      ]
    },
    {
      "categories": [
        {
          "kind": "other",
          "langcode": "ta",
          "name": "Social media",
          "orig": "ta:Social media",
          "parents": [],
          "source": "w"
        }
      ],
      "examples": [
        {
          "text": "Coordinate term: ஆமை (āmai)"
        }
      ],
      "glosses": [
        "a fan of actor Vijay"
      ],
      "id": "en-அணில்-ta-noun-mGKGXvP5",
      "links": [
        [
          "social media",
          "social media"
        ],
        [
          "derogatory",
          "derogatory"
        ],
        [
          "fan",
          "fan"
        ]
      ],
      "qualifier": "social media",
      "raw_glosses": [
        "(social media, chiefly derogatory) a fan of actor Vijay"
      ],
      "tags": [
        "derogatory"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/aɳil/"
    },
    {
      "audio": "Ta-அணில்.ogg",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/f/f1/Ta-%E0%AE%85%E0%AE%A3%E0%AE%BF%E0%AE%B2%E0%AF%8D.ogg/Ta-%E0%AE%85%E0%AE%A3%E0%AE%BF%E0%AE%B2%E0%AF%8D.ogg.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/f/f1/Ta-%E0%AE%85%E0%AE%A3%E0%AE%BF%E0%AE%B2%E0%AF%8D.ogg"
    }
  ],
  "word": "அணில்"
}
{
  "categories": [
    "Pages with 1 entry",
    "Pages with entries",
    "Tamil entries with incorrect language header",
    "Tamil irregular nouns",
    "Tamil l-stem nouns",
    "Tamil lemmas",
    "Tamil nouns",
    "Tamil terms with IPA pronunciation",
    "Tamil terms with redundant script codes",
    "ta:Animals",
    "ta:People",
    "ta:Squirrels"
  ],
  "derived": [
    {
      "roman": "aṇiṟpiḷḷai",
      "word": "அணிற்பிள்ளை"
    }
  ],
  "etymology_templates": [
    {
      "args": {
        "1": "kn",
        "2": "ಅಳಿಲು"
      },
      "expansion": "Kannada ಅಳಿಲು (aḷilu)",
      "name": "cog"
    },
    {
      "args": {
        "1": "ml",
        "2": "അണിൽ"
      },
      "expansion": "Malayalam അണിൽ (aṇil)",
      "name": "cog"
    }
  ],
  "etymology_text": "Cognate with Kannada ಅಳಿಲು (aḷilu) and Malayalam അണിൽ (aṇil).",
  "forms": [
    {
      "form": "aṇil",
      "tags": [
        "romanization"
      ]
    },
    {
      "form": "அணிற்கள்",
      "tags": [
        "plural"
      ]
    },
    {
      "form": "no-table-tags",
      "source": "declension",
      "tags": [
        "table-tags"
      ]
    },
    {
      "form": "ta-decl",
      "source": "declension",
      "tags": [
        "inflection-template"
      ]
    },
    {
      "form": "அணில்",
      "roman": "aṇil",
      "source": "declension",
      "tags": [
        "nominative",
        "singular"
      ]
    },
    {
      "form": "அணிற்கள்",
      "roman": "aṇiṟkaḷ",
      "source": "declension",
      "tags": [
        "nominative",
        "plural"
      ]
    },
    {
      "form": "அணிலே",
      "roman": "aṇilē",
      "source": "declension",
      "tags": [
        "singular",
        "vocative"
      ]
    },
    {
      "form": "அணிற்களே",
      "roman": "aṇiṟkaḷē",
      "source": "declension",
      "tags": [
        "plural",
        "vocative"
      ]
    },
    {
      "form": "அணிலை",
      "roman": "aṇilai",
      "source": "declension",
      "tags": [
        "accusative",
        "singular"
      ]
    },
    {
      "form": "அணிற்களை",
      "roman": "aṇiṟkaḷai",
      "source": "declension",
      "tags": [
        "accusative",
        "plural"
      ]
    },
    {
      "form": "அணிலுக்கு",
      "roman": "aṇilukku",
      "source": "declension",
      "tags": [
        "dative",
        "singular"
      ]
    },
    {
      "form": "அணிற்களுக்கு",
      "roman": "aṇiṟkaḷukku",
      "source": "declension",
      "tags": [
        "dative",
        "plural"
      ]
    },
    {
      "form": "அணிலுக்காக",
      "roman": "aṇilukkāka",
      "source": "declension",
      "tags": [
        "benefactive",
        "singular"
      ]
    },
    {
      "form": "அணிற்களுக்காக",
      "roman": "aṇiṟkaḷukkāka",
      "source": "declension",
      "tags": [
        "benefactive",
        "plural"
      ]
    },
    {
      "form": "அணிலுடைய",
      "roman": "aṇiluṭaiya",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "singular"
      ]
    },
    {
      "form": "அணிற்களுடைய",
      "roman": "aṇiṟkaḷuṭaiya",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "plural"
      ]
    },
    {
      "form": "அணிலின்",
      "roman": "aṇiliṉ",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "singular"
      ]
    },
    {
      "form": "அணிற்களின்",
      "roman": "aṇiṟkaḷiṉ",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "plural"
      ]
    },
    {
      "form": "அணிலில்",
      "roman": "aṇilil",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "singular"
      ]
    },
    {
      "form": "அணிற்களில்",
      "roman": "aṇiṟkaḷil",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "plural"
      ]
    },
    {
      "form": "அணிலிடம்",
      "roman": "aṇiliṭam",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "singular"
      ]
    },
    {
      "form": "அணிற்களிடம்",
      "roman": "aṇiṟkaḷiṭam",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "plural"
      ]
    },
    {
      "form": "அணிலோடு",
      "roman": "aṇilōṭu",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "singular"
      ]
    },
    {
      "form": "அணிற்களோடு",
      "roman": "aṇiṟkaḷōṭu",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "plural"
      ]
    },
    {
      "form": "அணிலுடன்",
      "roman": "aṇiluṭaṉ",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "singular"
      ]
    },
    {
      "form": "அணிற்களுடன்",
      "roman": "aṇiṟkaḷuṭaṉ",
      "source": "declension",
      "tags": [
        "error-unrecognized-form",
        "plural"
      ]
    },
    {
      "form": "அணிலால்",
      "roman": "aṇilāl",
      "source": "declension",
      "tags": [
        "instrumental",
        "singular"
      ]
    },
    {
      "form": "அணிற்களால்",
      "roman": "aṇiṟkaḷāl",
      "source": "declension",
      "tags": [
        "instrumental",
        "plural"
      ]
    },
    {
      "form": "அணிலிலிருந்து",
      "roman": "aṇililiruntu",
      "source": "declension",
      "tags": [
        "ablative",
        "singular"
      ]
    },
    {
      "form": "அணிற்களிலிருந்து",
      "roman": "aṇiṟkaḷiliruntu",
      "source": "declension",
      "tags": [
        "ablative",
        "plural"
      ]
    },
    {
      "form": "அணிற்பிள்ளை",
      "roman": "aṇiṟpiḷḷai",
      "tags": [
        "alternative"
      ]
    },
    {
      "form": "அணில்பிள்ளை",
      "roman": "aṇilpiḷḷai",
      "tags": [
        "alternative"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "pl": "அணிற்கள்"
      },
      "expansion": "அணில் • (aṇil) (plural அணிற்கள்)",
      "name": "ta-noun"
    }
  ],
  "inflection_templates": [
    {
      "args": {
        "1": "அணில்",
        "2": "அணிற்கள்",
        "3": "அணிலே",
        "4": "அணிலு"
      },
      "name": "ta-decl"
    }
  ],
  "lang": "Tamil",
  "lang_code": "ta",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        "Entries missing English vernacular names of taxa",
        "Entries using missing taxonomic name (species)"
      ],
      "glosses": [
        "Indian Palm Squirrel (Funambulus palmarum)"
      ]
    },
    {
      "glosses": [
        "squirrel (any of the rodents of the family Sciuridae)"
      ],
      "links": [
        [
          "squirrel",
          "squirrel#English:_Q9482"
        ],
        [
          "rodent",
          "rodent"
        ],
        [
          "Sciuridae",
          "Sciuridae#Translingual"
        ]
      ],
      "raw_glosses": [
        "(by extension) squirrel (any of the rodents of the family Sciuridae)"
      ],
      "tags": [
        "broadly"
      ]
    },
    {
      "categories": [
        "Tamil derogatory terms",
        "ta:Social media"
      ],
      "examples": [
        {
          "text": "Coordinate term: ஆமை (āmai)"
        }
      ],
      "glosses": [
        "a fan of actor Vijay"
      ],
      "links": [
        [
          "social media",
          "social media"
        ],
        [
          "derogatory",
          "derogatory"
        ],
        [
          "fan",
          "fan"
        ]
      ],
      "qualifier": "social media",
      "raw_glosses": [
        "(social media, chiefly derogatory) a fan of actor Vijay"
      ],
      "tags": [
        "derogatory"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/aɳil/"
    },
    {
      "audio": "Ta-அணில்.ogg",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/f/f1/Ta-%E0%AE%85%E0%AE%A3%E0%AE%BF%E0%AE%B2%E0%AF%8D.ogg/Ta-%E0%AE%85%E0%AE%A3%E0%AE%BF%E0%AE%B2%E0%AF%8D.ogg.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/f/f1/Ta-%E0%AE%85%E0%AE%A3%E0%AE%BF%E0%AE%B2%E0%AF%8D.ogg"
    }
  ],
  "word": "அணில்"
}

Download raw JSONL data for அணில் meaning in All languages combined (6.2kB)

{
  "called_from": "inflection/735",
  "msg": "inflection table: unrecognized header: 'genitive 1'",
  "path": [
    "அணில்"
  ],
  "section": "Tamil",
  "subsection": "noun",
  "title": "அணில்",
  "trace": ""
}

{
  "called_from": "inflection/735",
  "msg": "inflection table: unrecognized header: 'genitive 2'",
  "path": [
    "அணில்"
  ],
  "section": "Tamil",
  "subsection": "noun",
  "title": "அணில்",
  "trace": ""
}

{
  "called_from": "inflection/735",
  "msg": "inflection table: unrecognized header: 'locative 1'",
  "path": [
    "அணில்"
  ],
  "section": "Tamil",
  "subsection": "noun",
  "title": "அணில்",
  "trace": ""
}

{
  "called_from": "inflection/735",
  "msg": "inflection table: unrecognized header: 'locative 2'",
  "path": [
    "அணில்"
  ],
  "section": "Tamil",
  "subsection": "noun",
  "title": "அணில்",
  "trace": ""
}

{
  "called_from": "inflection/735",
  "msg": "inflection table: unrecognized header: 'sociative 1'",
  "path": [
    "அணில்"
  ],
  "section": "Tamil",
  "subsection": "noun",
  "title": "அணில்",
  "trace": ""
}

{
  "called_from": "inflection/735",
  "msg": "inflection table: unrecognized header: 'sociative 2'",
  "path": [
    "அணில்"
  ],
  "section": "Tamil",
  "subsection": "noun",
  "title": "அணில்",
  "trace": ""
}

This page is a part of the kaikki.org machine-readable All languages combined dictionary. This dictionary is based on structured data extracted on 2025-06-18 from the enwiktionary dump dated 2025-06-01 using wiktextract (074e7de and f1c2b61). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.